LLMs on a Shoestring: The Dynamic Cache Advantage by Arvind Sundararajan
dev.to·9h·
Discuss: DEV
💾Cache Algorithms
Conquering the LLM Memory Wall: How to Run 2–4x Longer Contexts with a Single Line of Code
reddit.com·7h·
Discuss: r/LocalLLaMA
🗺️Region Inference
Semantic Dictionary Encoding
falvotech.com·4h·
Discuss: Hacker News
🗂️Type Indexing
Building High-Performance Caching in Go: A Practical Guide
dev.to·18h·
Discuss: DEV
🧠Memory Models
Building a Simple Stack-Based Virtual Machine in Go
blog.phakorn.com·12h·
📚Stack Data Structures
More hardware won’t fix bad engineering
infoworld.com·10h
🔮Branch Predictors
What Facebook's Memcache Taught Me About Systems Thinking
lorbic.com·1h·
Discuss: Hacker News
Cache-Aware Algorithms
Identifying Divergences in HW Designs For High Performance Computing Workloads (LBNL et al.)
semiengineering.com·2h
Performance
Boost Windows 11 Performance: Clear Cache for Speed and Space
webpronews.com·5h
🧠Memory Consistency
The future of microoptimization
goldenstack.net·2d·
Discuss: Hacker News
🔬Nanopasses
H100 PCIe – 1.86 TB/s memcpy roofline and 8× uplift
news.ycombinator.com·1d·
Discuss: Hacker News
🧠Memory Hierarchy
StringWa.rs on GPUs: Databases & Bioinformatics 🦠
ashvardanian.com·12m·
Discuss: r/programming
🚀Tokenizer Performance
Why some agentic AI developers are moving code from Python to Rust
developers.redhat.com·12h
Interpreter Optimization
Disaggregated Inference at Scale with PyTorch and VLLM
pytorch.org·1d·
Discuss: Hacker News
🔄Subinterpreters
Topaz_Gigapixel_AI_8.4.3.dmg
xmac.app·3h
📦Executable Size
Language Models Pack Billions of Concepts into 12,000 Dimensions
nickyoder.com·15h·
🌱Minimal ML
What happens when you run a program?
dev.to·3h·
Discuss: DEV
📜Bytecode Interpreters
Intel Core Ultra 3 205 Gets Early Review
techpowerup.com·4h
Instruction Fusion
Inter-die gapfill tool claims advanced packaging breakthrough
edn.com·14h
🌐Portable Assembly
How fast do websites load from Google Search? Comparing loading methods
pawelpokrywka.com·2h·
Discuss: Hacker News
📊Profiling